Ultra-Low-Power Voice Activity Detection System Using Level-Crossing Sampling
نویسندگان
چکیده
This paper presents an ultra-low-power voice activity detection (VAD) system to discriminate speech from non-speech parts of audio signals. The proposed VAD uses level-crossing sampling for detection. useless samples in the signal are eliminated due activity-dependent nature this scheme. A 40 ms moving window with a 30 overlap is exploited as feature extraction block, within which output analog-to-digital converter (LC-ADC) counted feature. only variable used distinguish and segments input number LC-ADC time window. achieves average 91.02% hit rate 82.64% over 12 noise types at ?5, 0, 5, 10 dB signal-to-noise ratios (SNR) TIMIT database. including LC-ADC, extraction, classification circuits was designed 0.18 µm CMOS technology. Post-layout simulation results show power consumption 394.6 nW silicon area 0.044 mm2, makes it suitable always-on device automatic recognition system.
منابع مشابه
Zero-Crossing-Based Ultra-Low-Power A/D Converters Citation
| Since the first demonstration of a comparatorbased switched-capacitor circuit, analog-to-digital (A/D) converters based on virtual ground detection have made steady and significant progress. Comparators have been replaced by zero-crossing detectors, leading to the development of zerocrossing based circuits for faster speed and lower power. All facets of performance including the sampling rate...
متن کاملNoise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation
This paper proposes a novel hands-free voice activity detection (VAD) method utilizing not only temporal features but also spatial features, called adaptive zero crossing detection (AZCD), that uses talker direction estimation. It firstly estimates talker direction to extract two spatial features: spatial reliability and spatial variance, based on weighted cross-power spectrum phase analysis an...
متن کاملLow frequency ultrasonic voice activity detection using convolutional neural networks
Low frequency ultrasonic mouth state detection uses reflected audio chirps from the face in the region of the mouth to determine lip state, whether open, closed or partially open. The chirps are located in a frequency range just above the threshold of human hearing and are thus both inaudible as well as unaffected by interfering speech, yet can be produced and sensed using inexpensive equipment...
متن کاملSmart Ultra Low Power Energy Harvesting System
Small embedded systems operating in unattended conditions do need to be perpetually powered if a truly pervasive paradigm is envisaged. Harvesting energy from the surrounding environment seems to be the best option. For that, a set of systems has been proposed featuring interesting solutions but not yet capable of overcoming some issues like performance and flexibility. The authors propose a no...
متن کاملProcessing of Non-Stationary Signal Using Level-Crossing Sampling
The spectral characteristics of multimedia signals typically vary with time. Preferably, the sampling density of them would comply with instantaneous bandwidth of signal. The paper discusses the level-crossing sampling principle, which provides such capability for analog-to-digital conversion. As the captured samples are spaced non-uniformly, the appropriate digital signal processing is require...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2023
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics12040795